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ABSTRACT Elongation factor P (EF-P) is a universally conserved bacterial translation factor homologous to eukaryotic/archaeal 
initiation factor 5A. In Salmonella, deletion of the efp gene results in pleiotropic phenotypes, including increased susceptibility 
to numerous cellular stressors. Only a limited number of proteins are affected by the loss of EF-P, and it has recently been deter- 
mined that EF-P plays a critical role in rescuing ribosomes stalled at PPP and PPG peptide sequences. Here we present an unbi- 
ased in vivo investigation of the specific targets of EF-P by employing stable isotope labeling of amino acids in cell culture (SI- 
LAC) to compare the proteomes of wild-type and efp mutant Salmonella. We found that metabolic and motility genes are 
prominent among the subset of proteins with decreased production in the Ae/p mutant. Furthermore, particular tripeptide mo- 
tifs are statistically overrepresented among the proteins downregulated in efp mutant strains. These include both PPP and PPG 
but also additional motifs, such as APP and YIRYIR, which were confirmed to induce EF-P dependence by a translational fusion 
assay. Notably, we found that many proteins containing polyproline motifs are not misregulated in an EF-P-deficient back- 
ground, suggesting that the factors that govern EF-P-mediated regulation are complex. Finally, we analyzed the specific region of 
the PoxB protein that is modulated by EF-P and found that mutation of any residue within a specific GSCGPG sequence elimi- 
nates the requirement for EF-P. This work expands the known repertoire of EF-P target motifs and implicates factors beyond 
polyproline motifs that are required for EF-P-mediated regulation. 

IMPORTANCE Bacterial cells regulate gene expression at several points during and after transcription. During protein synthesis, 
for example, factors can interact with the ribosome to influence the production of specific proteins. Bacterial elongation factor P 
(EF-P) is a protein that facilitates the synthesis of proteins that contain polyproline motifs by preventing the ribosome from 
stalling. Bacterial cells that lack EF-P are viable but are sensitive to a large number of stress conditions. In this study, a global 
analysis of protein synthesis revealed that EF-P regulates many more proteins in the cell than predicted based solely on the prev- 
alence of polyproline motifs. Several new EF-P-regulated motifs were uncovered, thereby providing a more complete picture of 
how this critical factor influences the cell's response to stress at the level of protein synthesis. 
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Regulation of gene expression can occur at many steps en route 
from a DNA-encoded gene to a mature protein. Protein syn- 
thesis can be controlled by self-mediated stalling of a peptide 
chain that exerts a regulatory effect on the translation of the pro- 
tein or that of downstream genes. For example, the SecM tran- 
script stalls its own translation in a nascent polypeptide- 
dependent manner via a series of complex interactions with the 
ribosome (1,2). This stall leads to remodeling of an mRNA hair- 
pin, thereby revealing the ribosome binding site of the down- 
stream secA gene, which results in the increased production of 
SecA (3). SecA in turn forms a feedback loop by targeting the 
SecM-stalled ribosome to the protein export machinery, which 
allows translation of SecM to resume (2, 4). Though a number of 
proteins that mediate their own translational stalls have been 
characterized, each is unique, and common sequence motifs have 



not been well defined, although certain themes have emerged (5). 
Stalling can occur, for example, if the transcript contains an inter- 
nal sequence resembling a Shine-Dalgarno sequence (6). Interac- 
tion of the nascent polypeptide with the ribosomal exit tunnel and 
peptidyl transferase center can also alter the ribosome such that if 
the A-site aminoacyl-tRNA is a poor acceptor for the particular 
P-site peptide, peptide bond formation will be stalled (5, 7). Fi- 
nally, studies have identified C-terminal sequences capable of 
blocking termination at stop codons by inhibiting the action of 
ribosomal release factors (5, 8, 9). 

Elongation factor P (EF-P) is a universally conserved bacterial 
translation factor that was originally characterized for its ability to 
stimulate peptide bond synthesis in vitro. In Escherichia coli and 
Salmonella, EF-P undergoes a posttranslational modification ho- 
mologous to hypusination of its eukaryotic homolog, eukaryotic 
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FIG 1 A subset of proteins is significantly misregulated in Aefp Salmonella. The histogram outlines the 
distribution of protein synthesis ratios identified in SILAC. Columns indicate the number of proteins 
with an average synthesis ratio between two neighboring x axis values. Underlined values in the x axis 
indicate a change in scale. The inset table shows the number of SILAC hits demonstrating a greater than 
2-fold difference in protein level between the efp + (WN1269) and Aefp (WN1308) strains. The second 
column further indicates the proteins with a significance B value of less than 0.01 in at least one trial. 
"Total" indicates the number of proteins identified in at least one replicate regardless of expression ratio, 
and the second column of this row includes proteins with any expression ratio that had a significance B 
value of less than 0.0 1 in at least one trial. Synthesis ratios shown are the average normalized heavy/light 
ratios of three biological replicates. 



translation initiation factor 5A (eIF5A), wherein a unique /3-lysine 
residue is added to a conserved lysyl residue by the combined 
activities of PoxA and YjeK (the PYE pathway). The EF-P protein 
has a shape similar to that of a tRNA and interacts with the ribo- 
some in a unique position contacting the E site of the 30S subunit 
and the P site on the ribosome. The jSTysylated residue of EF-P is 
oriented in the ribosome such that it projects into the peptidyl 
transferase center (PTC), presumably to modulate peptide bond 
formation. We and other laboratories have noted that deletion of 
any of the PYE genes in Salmonella results in pleiotropic pheno- 
types, including increased susceptibility to a range of cellular stres- 
sors, as well as motility defects and impaired nutrient utilization 
relative to findings for the wild-type strain (10-13). It has further 
been shown that /3-lysylated EF-P also undergoes hydroxylation 
by YfcM, but this additional modification does not appear to be 
critical for EF-P function based on phenotypic and in vivo analyses 
(14, 15). 

Proteomic analysis of Salmonella poxA and efp mutants and 
Agrobacterium efp (chvH) mutants suggested that EF-P acts in a 
targeted manner rather than as a global translation factor (10, 11, 
13, 16). Recent work has demonstrated that EF-P rescues the 
translation of elongating polypeptides that are stalled at particular 
tripeptide motifs, in particular PPP or PPG (9, 17, 18). EF-P was 
recently shown to not have an effect at other types of stalling 
motifs, such as the C-terminal motifs that inhibit release factors 
(9). The discovery of EF-P target motifs lends insight into why 
specific proteins are subject to EF-P-mediated regulation. Fur- 
thermore, the pleiotropic phenotypes exhibited by bacteria lack- 
ing EF-P could likely be explained by the presence of polyproline 
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stretches in factors essential for the phe- 
notype. For instance, a number of flagel- 
lar genes contain such motifs, and their 
misregulation likely contributes to the 
motility defect observed in PYE mutants 
(10, 13). However, previous work has fo- 
cused on in vitro study of particular EF-P 
targets and has not thoroughly evaluated 
the role of EF-P in vivo. 

Here we present an unbiased in vivo 
comprehensive analysis of the Salmonella 
proteins affected by EF-P, using stable- 
isotope labeling (SILAC) (19, 20). While 
we confirmed that several proteins con- 
taining polyproline motifs are subject 
to EF-P-mediated regulation, we have 
found that the presence of polyproline 
tracts is not sufficient to predict EF-P de- 
pendence, since many proteins contain- 
ing such motifs are unaffected by the 
absence of EF-P. We also identified alter- 
nate target sequences in addition to PPP 
and PPG that render peptide synthesis 
dependent on EF-P. Finally, we investi- 
gated the determinants of EF-P depen- 
dence for pyruvate oxidase (PoxB), which 
was previously shown to be downregu- 
lated approximately 8-fold in E. coli 
strains lacking PoxA despite the absence 
of a polyproline sequence (21). 

RESULTS 

Identification of EF-P-regulated proteins by SILAC. We previ- 
ously examined the proteome of the Salmonella enterica poxA mu- 
tant using two-dimensional difference gel electrophoresis (2D- 
DIGE). In that study, total cellular proteins from the wild type and 
poxA mutants were labeled with fluorescent dyes, mixed in equal 
amounts, and then separated by 2D-PAGE prior to analysis (11). 
Using this technique, we determined that a relatively small subset 
of proteins were affected by perturbations in the PYE pathway — a 
finding in agreement with earlier work on the efp mutant of Agro- 
bacterium (16). However, we were able to unambiguously identify 
only a small number of these proteins by mass spectrometry due to 
crowding on the 2D gel. 

To gain a more comprehensive view of the effect of EF-P on 
protein levels, we employed stable isotope labeling of amino acids 
in cell culture (SILAC) in conjunction with quantitative mass 
spectrometry-based proteomics to examine the proteome of an 
efp mutant strain of Salmonella enterica serovar Typhimurium 
strain SL1344 (strain WN1308). We examined the profiles of three 
biological replicates and in total were able to detect, quantify, and 
identify a total of 1,517 proteins, or approximately 34% of the 
4,514 proteins predicted to be encoded in the S. Typhimurium 
strain SL1344 genome (Fig. 1). To identify proteins showing al- 
tered levels, we first selected candidates that showed a significant 
difference between the wild-type and efp mutant strains in at least 
one of the three biological replicates with a significance B (an 
internal significance score as described in Materials and Methods) 
cutoff of 0.01. By this criterion, 87 proteins showed changes of 
2-fold or greater and 28 displayed a change of greater than 10-fold. 
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TABLE 1 Bioinformatic analysis predicts EF-P-regulated tripeptide motifs" 



Motif* 


Occ Mgh 


ExpMgh 


Occ mid 


Exp mid 


Occ low 


Expi ow 


Enrich^ 


Enrich low 


P value' 


PPP 


18 


4.96 


15 


29.1 


o 


3.16 


4.09 


0.00 


2. IE— 10 


PPG 


28 


8.48 


26 


49.7 


1 


5.40 


3.82 


0.21 


5.6E— 14 


APP 


18 


11.1 


22 


65.1 


0 


7.07 


3.38 


0.00 


7.6E-10 


RME 


12 


5.30 


16 


31.1 


2 


3.38 


3.00 


0.79 


1.3E-04 


YIR 


10 


4.64 


14 


27.2 


1 


2.96 


3.00 


0.47 


4.7E-04 


TQM 


11 


3.64 


17 


21.4 


0 


2.32 


2.95 


0.00 


1.4E-04 


PFF 


10 


2.84 


15 


16.6 


2 


1.81 


2.78 


0.87 


7.4E-05 


DPP 


5 


6.73 


8 


39.4 


1 


4.29 


2.68 


0.84 


2.0E-07 


FFL 


5 


6.15 


8 


36.1 


1 


3.92 


2.68 


0.84 


1.4E-06 


QNA 


22 


10.0 


43 


58.8 


5 


6.39 


2.36 


0.84 


3.0E-05 



" Prevalence among the SILAC hits of all possible tripeptide combinations of the 20 common amino acids. Occ, motif occurrences in the indicated subgroup of the SILAC data; 
Exp, expected motif occurrences in the indicated subgroup based on amino acid prevalence; Enrich, calculated motif enrichment index in the indicated subgroup of the SILAC data; 
high, low, and mid, subgroups of SILAC data encompassing the 10% of proteins with the highest (high) or lowest (low) WN1269/WN1308 ratios; "mid" incorporates the remaining 
80%. 

b Motifs shown are those that demonstrated the greatest enrichment index within the 10% of proteins most downregulated in the efp mutant (highest WN1269/WN1308 ratio). 
c P values were calculated using the chi-square test with Yates' correction as described in Materials and Methods. 



Of the 87 significantly misregulated proteins, 49 showed de- 
creased steady-state levels in the efp mutant strain and are more 
likely to be direct targets of EF-P owing to its characterized stim- 
ulatory effect on translation (17, 18). 

Identification of amino acid motifs enriched in EF-P- 
regulated proteins. To determine if misregulation of specific pro- 
teins in efp mutants can be attributed to certain motifs, we ana- 
lyzed the SILAC data for the prevalence of specific peptide 
sequences in proteins showing a strong decrease or increase in the 
Salmonella efp mutant. Since tripeptide motifs (PPP and PPG) had 
previously demonstrated EF-P-dependent translation, we 
searched for the frequency of each possible tripeptide motif in the 
10% of proteins most strongly affected (either up or down) by 
EF-P. We compared the actual frequency of each motif to its ex- 
pected frequency and the frequency observed in the remaining 
90% of proteins identified in SILAC. An enrichment score was 
calculated based on the ratio between observed occurrences of a 
tripeptide within this group, normalized against the actual-over- 
expected occurrences in all retrieved SILAC hits (see Materials and 
Methods). Using this method, we identified a number of tripep- 
tide motifs that were enriched in the 10% of proteins that were 
most downregulated in the efp mutant (Table 1). The two top- 
scoring tripeptide sequences identified were PPP and PPG, sup- 
porting the recent findings regarding EF-P regulation and also 
acting as a proof of principle for our analysis. Additionally, a num- 
ber of previously uncharacterized sequence motifs were also iden- 
tified, including the third-highest scoring sequence, APP, impli- 
cating them as potential EF-P-dependent motifs. While this article 
was in preparation, Woolstenhulme et al. identified the APP- 
containing motifs MRAPP and WAPP as sequences capable of 
stalling ribosomes (9). 

Alternative non-proline-containing motifs can render pro- 
tein synthesis dependent on EF-P. Results from the motif analysis 
lend support to the previously characterized EF-P-dependent se- 
quences but also propose novel motifs that may require EF-P for 
efficient translation. To assess the possibility that EF-P plays a role 
in the translation of motifs other than polyproline, we constructed 
a number of translational fusion plasmids wherein putative target 
sequences were inserted in-frame at the fourth codon ofgfp. Green 
fluorescent protein (GFP) fluorescence was significantly de- 
creased in an efp mutant relative to that in wild-type E. coli when 



any of the PPP, PPG, or APP motifs were inserted into GFP (Ta- 
ble 2). Interestingly, insertion of a YIRYIR motif also caused pro- 
duction of the GFP reporter to be dependent on EF-P. This is the 
first EF-P target motif identified that lacks a proline residue. In 
contrast, none of the other predicted motifs — including an unre- 
pealed YIR sequence — yielded a significant difference in synthe- 
sis. We further analyzed the proteins containing non-EF-P- 
dependent motifs to determine the reasons they were enriched in 
our bioinformatic analysis. Of the 10% of proteins that were most 
downregulated in the efp mutant, 16 of the 29 proteins containing 
an RME, PFF, or YIR motif also have a PPP, PPG, or APP motif, 
which may explain their EF-P dependence. For the 13 proteins 
that contain an RME, PFF, or YIR motif but lack a PPP, PPG, or 
APP motif, we aligned the residues upstream of the putative reg- 
ulatory motif but did not identify any conserved sequences (data 
not shown). The underlying cause of the EF-P dependence of 
these proteins remains to be investigated. 

The presence of a PPP, PPG, or APP motif is not sufficient to 
confer dependence on EF-P. To investigate whether the charac- 



TABLE 2 Verification of predicted EF-P-dependent motifs' 1 



Motif 


GFP fluorescence 
{WT/Aefp) b 


Null c 


0.95 ± 0.09 


PPPPPP 0 << 


20.68 ± 0.38 


PPPPPP/ 


18.55 ± 0.38 


PPPPPP/ 


18.12 ± 0.20 


PPPPPP 3 < ; 


15.97 ± 0.08 


PPP 


4.72 ± 0.23 


PPG 


10.39 ± 0.66 


APP 


5.88 ± 0.20 


RME 


1.34 ± 0.07 


YIR 


0.96 ± 0.13 


YIRYIR 


7.61 ± 2.54 


PFF 


1.58 ± 0.19 



" Motifs were assayed for EF-P dependence by insertion into the 4th codon position of 
GFP. 

b Values at 21 h postinduction were normalized to cotranscriptionally expressed 
mCherry and are shown as a ratio of wild-type and efp mutant strains expressing the 
same construct. All values are the averages ± standard deviations for three biological 
replicates 

c No motif inserted into GFP. 

d Sequence of six optimal {CCG; 0) or random {1 to 3} proline codons. 
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FIG 2 Comparison of proteins identified in SILAC and those with EF-P target motifs. (A) Venn diagram outlining the overlap of: proteins conclusively 
identified in SILAC, proteins significantly downregulated in WN1308, and proteins containing a PPP, PPG, or APP motif as annotated in the Salmonella serovar 
Typhimurium strain SL1344 genome (GenBank accession no. FQ312003.1). (B) DAVID analysis of SILAC hits showing a decreased protein level in the Ae/p 
strain (ratio > 2) and a significance B score of less than 0.01 in at least one trial. Functional annotation clusters showing significant overrepresentation (P value 
< 0.05) are shown. Cluster breakdown can be found in Table S4 in the supplemental material. (C) DAVID analysis showing the most significantly overrepre- 
sented clusters among the 422 proteins that contain an EF-P target motif. For clarity, only groups with P values < 0.001 are shown here. For a full list, see Table S5. 
(D) DAVID analysis showing the only significantly overrepresented cluster (P value < 0.05) among the 20 proteins that fall into all three categories (contain an 
EF-P target motif and were significantly downregulated in the Ae/p strain in SILAC). Cluster breakdown can be found in Table S6 in the supplemental material. 



terized target motifs are sufficient for EF-P regulation in vivo, we 
compared the existence of a PPP, PPG, or APP motif in proteins 
with their SILAC ratios. Of the 422 proteins that contain one of the 
three target motifs, 100 were conclusively identified in SILAC 
(Fig. 2A). Only 20 of these were found to be significantly down- 
regulated in the efp mutant strain. A more conservative analysis of 
proteins with SILAC ratios of less than 2 (without regard for the 
significance score) found that there were 45 proteins identified in 
SILAC that were not downregulated in WN1308 yet have a PPP, 
PPG, or APP motif (12, 22, and 16 proteins, respectively, contain 
each motif; 5 proteins have two of the three motifs). This demon- 
strates that a large percentage of proteins that contain a character- 
ized EF-P target motif are not misregulated in the efp deletion 
strain. Examples include ZipA, SseA, and YtfM, which were iden- 
tified to have average expression ratios of 1.07, 0.74, and 0.84, 
respectively, with significance B scores that were far from signifi- 
cant in all three replicates (see Table S2 in the supplemental ma- 
terial). Though they were not misregulated in the efp mutant, 
Salmonella ZipA has two distinct APP motifs, as well as an APPP 
motif, SseA has an APPG motif, and YtfM contains a PPP motif. 



Taken together, these data suggest that the presence of particular 
tripeptide motifs may not, in and of itself, be sufficient to cause 
translational stalling in the absence of EF-P. 

To investigate whether the position of a target motif within a 
protein affected EF-P regulation, we examined the distribution of 
the proteins where the target motif occurs in the first 50 residues 
(early), where it occurs between residues 50 and 200 (middle), and 
where the motif occurs after the 200th residue (late). For the 
group of 20 target motif-containing proteins that were signifi- 
cantly downregulated in the efp mutant, we found that 30%, 40%, 
and 30% of proteins fall into the early, middle, and late categories, 
respectively. This is comparable to the 80 motif-containing pro- 
teins that were not significantly downregulated, for which we find 
that 25%, 39%, and 36% of proteins are grouped into the early, 
middle, and late categories, respectively. We also examined se- 
quence context by aligning the motif region of a number of EF-P- 
dependent and EF-P-independent motif-containing proteins. Re- 
sults from this analysis did not conclusively identify any upstream 
sequences consistent with EF-P dependence or independence 
(data not shown). 
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EF-P disproportionately affects synthesis of signaling and 
nucleotide-binding/metabolic proteins. To examine whether 
EF-P affected the synthesis of all classes of proteins similarly or if 
there was a particular bias toward a specific subset of proteins, the 
SILAC data were subjected to a functional annotation analysis 
using the Database for Annotation, Visualization and Integrated 
Discovery (DAVID) software package (22-24). The program 
compares a list of genes with functional annotation databases, 
including GO (gene ontology) terms, KEGG pathways, and SP- 
PIR (Swiss-Prot and Protein Information Resource) keywords, 
among others. By comparing the prevalences of proteins belong- 
ing to these categories in their respective database and in the input 
gene list, DAVID generates a P value highlighting significantly 
overrepresented annotation terms. 

Upon analysis of the 49 significantly downregulated proteins 
identified in SILAC, we find that four clusters demonstrate over- 
representation, with a cluster P value of less than 0.05 (Fig. 2B; see 
also Table S4 in the supplemental material). Most prominent 
among these terms were two-component regulatory systems, with 
particular emphasis on proteins involved in chemotaxis and mo- 
tility. Furthermore, metabolic proteins were also abundantly 
downregulated as annotated by functions in nucleotide binding, 
oxidative phosphorylation, or proteolysis. 

We subsequently examined the S. Typhimurium SL1344 ge- 
nome for proteins containing EF-P-dependent sequences and 
found 422 open reading frames (ORFs) encoding a PPP, PPG, or 
APP motif. These motifs occurred in 112 (PPP), 195 (PPG), and 
185 (APP) proteins, with 70 of those proteins containing more 
than one of these motifs. In contrast, the sequence YIRYIR is not 
present in any Salmonella protein. To examine the physiological 
processes involving proteins with EF-P-dependent motifs, we 
conducted a DAVID cluster analysis and found that the overrep- 
resented functional groups were similar to those identified among 
proteins that were significantly downregulated in SILAC (Fig. 2C) . 
Many of the identified proteins with a PPP/PPG/APP motif are 
predicted membrane proteins, a group we previously investigated 
by comparative gel electrophoresis. We previously identified the 
outer membrane porin, KdgM, as significantly upregulated in the 
efp mutant, which we observed to occur at the level of transcrip- 
tion (13). The SILAC analysis confirms KdgM as a protein that is 
38-fold more abundant in the efp mutant (see Table S2 in the 
supplemental material). However, the SILAC data did not identify 
any EF-P-dependent regulators (e.g., transcriptional repressors) 
that would easily explain why certain proteins, such as KdgM, are 
observed to have higher levels in the efp mutant strain. 

Twenty proteins that both contained a target motif and were 
significantly downregulated in SILAC were identified. Of these, 7 
were identified as nucleotide binding proteins — the only overrep- 
resented category in this subset of 20 proteins (Fig. 2D). Some of 
these proteins are involved in central metabolism. AtpD, the cat- 
alytic subunit of the F 0 F 1 ATPase, had previously been found to be 
downregulated in the proteome oipoxA mutants (11). PfkB is a 
phosphofructokinase that functions in glycolysis. Three of the 
other EF-P-dependent nucleotide binding proteins (HflB/FtsH, 
HslU, and ClpB) play a role in protein stability and turnover. The 
function of the polyproline motifs in these proteins is unlikely to 
be universally conserved. For example, in AtpD (25), ClpB (26), 
and PfkB (PDB identifier: 3UMP), the putative EF-P-dependent 
motifs are not proximal to the region of the protein that interacts 



with ATP, whereas in HflB/FtsH, the putative EF-P-dependent 
motif (GPPG) makes contact with AMP (27). 

Some proteins identified as EF-P dependent by SILAC or by the 
presence of a polyproline motif can provide a parsimonious ex- 
planation for previously described phenotypes of Salmonella 
strains lacking PoxA or EF-P. We observed, for example, that the 
enzyme gamma-glutamyl transferase (Ggt) is present at a level 
approximately 16-fold lower in the efp mutant than in wild-type 
Salmonella (see Table S2 in the supplemental material). Ggt con- 
tains a short polyproline motif (residues 291 to 293), and its 
strongly reduced synthesis in mutant cells likely explains why Sal- 
monella strains lacking EF-P, PoxA, or YjeK are simultaneously 
unable to utilize y-glutamyl-glycine as a nitrogen source and are 
resistant to the compound S-nitrosoglutathione (GSNO) (13, 28). 
The misregulation of proteins involved in motility and che- 
motaxis, such as CheA, which is downregulated approximately 
4-fold in the efp mutant, likely contributes to the observed motil- 
ity defect in poxA, yjeK, and efp mutant strains (13). 

Effect of EF-P on synthesis of PoxB and identification of the 
putative regulatory motif. Pyruvate oxidase (PoxB) was impli- 
cated as a protein dependent on EF-P for its synthesis when it was 
noted that levels of this enzyme were decreased 5- to 8-fold in 
E. coli poxA mutants (21, 29). The PoxB protein, however, lacks 
any of the previously described EF-P-dependent motifs, which 
leaves open the question as to whether the PoxB protein is indeed 
a target of EF-P and, if so, what particular sequence renders it 
dependent on EF-P for its synthesis. We were unable to detect 
PoxB in our SILAC assay due to the particular growth conditions 
employed. To circumvent this issue, we constructed a reporter 
vector wherein the PoxB 5' untranslated region (UTR) and open 
reading frame were fused to a "superfolder" variant of green flu- 
orescent protein (sfGFP). Production of the chimeric protein was 
driven from a constitutive tet promoter, which enabled us to mea- 
sure translation independently of transcription. In a Salmonella 
background, the full-length PoxB protein fused to sfGFP showed a 
marked and reproducible dependence on EF-P for synthesis 
(Fig. 3) despite the fact that transcription of this chimeric con- 
struct was similar between the mutant and wild-type strains as 
measured by quantitative real-time reverse transcriptase PCR (see 
Fig. SI in the supplemental material). Interestingly, though simi- 
lar poxB transcript levels indicate that transcription is not altered, 
gfp mRNA levels were decreased in strains expressing EF-P- 
dependent constructs. However, since previous in vitro data have 
shown EF-P rescuing ribosome stalls in the absence of nucleases 
(9, 17, 18), we believe that this degradation is not the cause of 
decreased GFP fluorescence but rather is an effect of ribosome 
stalling at EF-P-dependent motifs leading to reduced ribosome- 
mediated nuclease protection of the 3' end of the poxB-gfp tran- 
script. 

Decreased fluorescence was observed not only in an efp mutant 
but also in poxA and yjeK mutants (see Fig. S2 in the supplemental 
material). In contrast, a strain lacking the YfcM protein did not 
show any significant difference in synthesis of the PoxB-sfGFP 
fusion construct. These results are consistent with previous data 
showing that YfcM-mediated hydroxylation of EF-P is not critical 
for its function (14). 

To identify the region essential for EF-P dependence, we sys- 
tematically generated C-terminal truncations of the poxB se- 
quence fused to sfGFP (Fig. 3A). By comparing fluorescence of 
these constructs in wild-type and efp mutant Salmonella, we nar- 
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FIG 3 The GSCGPG motif of PoxB renders it dependent on EF-P. (A) Outline of translational fusion constructs expressing full-length (FL) PoxB or C-terminal 
truncations fused to sfGFP. Construct designations are shown in bold at left as a figure key and indicate the length in codons of each truncated poxB gene. The 
arrow indicates the transcriptional start site under the control of the PLtetO- 1 constitutively active promoter, and "fMet" indicates the ATG start codon of PoxB. 
Numbers indicate poxB lengths in codons. For clarity, only a selection of constructs is illustrated. (B) Sequence of amino acids 69 to 82 of PoxB. The GSCGPG 
motif is bold and underlined. (C) Relative GFP fluorescence of PoxB truncations. Values were taken at 10 h postinoculation, were normalized to the optical 
density at 600 nm (OD 600 ), and are shown relative to those of the wild-type strain expressing the same construct. The same plasmid encoding the first 186 codons 
of lacZ instead of poxB is included as a control. (D) Relative GFP fluorescence (as in panel C) for single-residue mutations to leucine in the full-length PoxB 
construct. All values are the averages for at least 3 biological replicates. Each error bar shows 1 standard deviation. 



rowed the critical region to a 14-amino-acid stretch between res- 
idues 69 and 82 with the sequence VCAGSCGPGNLHLI (Fig. 3B 
and C). We confirmed that the amino acid (as opposed to the 
nucleotide) sequence in this region was critical for EF-P depen- 
dence by generating a construct with this region shifted to the + 1 
reading frame and observing an abolition of the requirement for 
EF-P (see Fig. S3 in the supplemental material). 



Residues 69 to 82 of PoxB contain a GPG motif that we hypoth- 
esized could, like PPP, mediate the EF-P dependence of the PoxB 
protein. To address this, we performed site-directed mutagenesis 
on the GPG motif in the full-length PoxB-sfGFP construct and on 
additional upstream and downstream residues, converting each 
residue to a leucine. Leucine was chosen because of its absence 
from the top-scoring motifs predicted by our analysis of proteins 
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downregulated in the efp mutant (Table 1). We found that muta- 
tion of any residue in the GPG motif or in the upstream three 
residues (sequence GSC) restored GFP fluorescence in the efp mu- 
tant (Fig. 3D). In contrast, mutation of residues either down- 
stream or upstream of the GSCGPG motif had negligible effects on 
the synthesis of the reporter construct. Notably, the GSCGPG se- 
quence is found only in PoxB in Salmonella. These data elucidate a 
novel EF-P-dependent motif that, like the artificially derived 
YIRYIR, is considerably larger than the previously described trip- 
eptide motifs and can cause a protein to be dependent on EF-P for 
proper synthesis in vivo. 

DISCUSSION 

The recent finding that EF-P alleviates stalls caused by polyproline 
motifs during translational elongation provides a unifying mech- 
anism for the physiological consequences that are triggered by the 
loss of this factor. Here we conclusively identified and assessed the 
steady-state levels of more than 1,500 Salmonella proteins using 
SILAC and found that less than 6% were significantly misregu- 
lated by more than 2-fold in a strain lacking EF-P. A functional 
annotation analysis of the corresponding downregulated genes 
revealed that proteins involved in responding to environmental 
stimuli, motility, metabolism, and proteolysis were overrepre- 
sented among the group affected by the loss of EF-P. These data 
lend insight into the causes underlying the pleiotropic phenotypes 
exhibited by efp mutants by implicating particular pathway com- 
ponents. For instance, the finding that two-component systems 
were the most highly overrepresented group among the down- 
regulated proteins suggests that impaired responses to external 
stimuli contribute to the observed sensitivity of the efp mutant to 
a range of pharmacologically unrelated antibiotics. Determining 
the specific defective pathways in future studies may point to po- 
tential targets for novel antimicrobials that would act synergisti- 
cally with existing drugs. 

Following the discovery that EF-P rescues ribosomes stalled at 
particular tripeptide motifs, we probed the proteomic data for an 
abundance of such sequences in downregulated proteins. We 
found that in addition to the characterized PPP and PPG motifs, 
APP was also prominent among downregulated proteins, suggest- 
ing it as a novel EF-P target, which was confirmed using a trans- 
lational fusion assay. In addition to the discovery of APP as an 
EF-P dependent motif, our SILAC data have also led to the iden- 
tification of the first characterized EF-P target motif that lacks a 
proline: YIRYIR. Furthermore, our analysis of PoxB identified the 
motif GSCGPG as responsible for its regulation by EF-P. Cumu- 
latively these data expand the known repertoire of EF-P target 
motifs and demonstrate, in certain instances, that they may re- 
quire sequences longer than three amino acids. This is further 
reinforced by the fact that the tripeptide motifs can be relatively 
common in the proteome and yet the overall number of proteins 
affected strongly by the loss of EF-P is considerably smaller. In- 
deed, of the 100 tripeptide target motif-containing proteins iden- 
tified in SILAC, only 20 were significantly downregulated. The 
data suggest that tripeptide motifs, including polyproline tracts, 
may not always be solely sufficient to mediate a translational stall 
and may depend on upstream interactions within the exit tunnel 
as well. Conversely, some weaker tripeptide motifs, such as the 
GPG of PoxB, may instigate ribosomal stalls if sufficiently 
strengthened by upstream interactions. 

It is clear that not all stalls are created equal, and we note that 



previously reported peptides that stall during elongation function 
in wild-type circumstances where EF-P is present and (presum- 
ably) active. Multiple groups have reported on proteins that insti- 
gate a translational stall via not only a particular sequence in the 
vicinity of the PTC but also upstream residues in the nascent poly- 
peptide chain that interact with the ribosomal exit tunnel. These 
stalls may be dependent on external factors, such as the antibiotic 
erythromycin in the case of ermALl and tryptophan for TnaC, or 
maybe self-mediated, as for SecM and MifM (1, 3, 7, 30-32). The 
laboratory of Allen Buskirk identified a number of stall sequences 
containing polyproline tracts near the PTC that also require up- 
stream residues for efficient stalling (5, 9). Future work will delin- 
eate the specific structural features that enable certain nascent 
chain-induced stalls to be alleviated by EF-P, while others, as in 
the case of SecM or TnaC, require alternate factors to rescue the 
stall. Also unresolved is whether EF-P preferentially targets partic- 
ular subsets of proteins (e.g., nucleotide binding proteins) for the 
purpose of regulation or if this EF-P-dependent subset of proteins 
merely requires difficult-to-translate structural features, such as 
polyprolines, for functional purposes. 

MATERIALS AND METHODS 

Bacterial strains and plasmids. For SILAC, we employed an arginine and 
lysine auxotroph (AargH AlysA) derivative of Salmonella enterica serovar 
Typhimurium strain SL1344 utilized in an earlier study (33). Auxotrophy 
was confirmed by growth in minimal medium in the absence of lysine or 
arginine. The Aefp mutation from strain WN934 was transferred into the 
auxotrophic strain by transduction using the HT105/1 int-201 derivative 
of phage P22 (34) to generate strain WN1308. 

The E. coli strains used for motif verification were derivatives from the 
Keio knockout collection (35), including Ay/cm, Ayjek, ApoxA, and Aefp 
strains andBW25113 (wild type). Deletion of genes and removal of kana- 
mycin cassettes using FLP recombinase (36) was confirmed by PCR. Plas- 
mids used for motif verification were derivatives of pBAD30 (37) contain- 
ing a tandem fluorescent fusion cassette composed of green fluorescent 
protein (GFP) followed immediately by the mCherry Shine-Dalgarno se- 
quence and mCherry, both of which are optimized for synthesis in E. coli 
(40). This construct is designated pBAD30mw700 and served as the PCR 
template for construction of other reporters (see Table SI in the supple- 
mental material). Unless otherwise stated, the codons used for motif in- 
serts were optimized according to tRNA abundance in E. coli (41). 

Using the red-gam recombinase protocol described above, we gener- 
ated strain WN 1 405 by deleting the majority of the efp gene (base pairs 1 45 
to 424) from S. Typhimurium strain 14028s such that the new mutation 
would not influence the promoter of the upstream antiparallel yjeK gene. 
The mutant allele was transduced to a fresh strain background prior to 
experimentation. Deletions of efp were confirmed by PCR amplification 
using the primers WNp582 and WNp583 (see Table SI in the supplemen- 
tal material) . WN 1 405 and the previously used Aefp mutant, WN934 (42 ) , 
behave identically under all conditions tested (data not shown). 

Plasmids used for the PoxB translational fusion assay were generated 
by PCR amplification of the 5' UTR (28 bp upstream of the start codon) 
and full-length (excluding the stop codon) or C-terminally truncated 
poxB gene using S. Typhimurium strain 14028s genomic DNA as a tem- 
plate. These amplicons were inserted into the Nsil and Nhel sites of 
pXGlOsf (38) to generate translational fusions to "superfolder" GFP 
(sfGFP). The sfGFP is a variant with improved folding and fluorescence in 
E. coli (38, 39) and we found that it substantially improved the sensitivity 
of our assay compared to results with standard GFP variants. The plasmid 
employs the constitutively active PLtetO- 1 promoter and the tightly con- 
trolled pSClOl low-copy-number origin of replication to ensure minimal 
variation in transcript levels. Point mutations were generated by site- 
directed mutagenesis of the full-length PoxB construct. Plasmids were 
named according to the truncation site inpoxB (i.e., the PoxB 68 construct 
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is truncated after codon 68 of poxB) or the location of a specific point 
mutation in full-length poxB (e.g., PoxB P76L contains a proline-to- 
leucine mutation at codon 76). LacZ186, included as a control, consists of 
the first 186 codons of the E. colt lacZ gene inserted into the Nsil andNhel 
sites of the pXGlOsf plasmid, similar to the PoxB fusions. 

SILAC. For stable isotope labeling of amino acids in cell culture (SI- 
LAC), we supplemented morpholinepropanesulfonic acid (MOPS) min- 
imal medium with amino acids at the concentrations previously described 
(33) and used 0.2% (wt/vol) glycerol as a carbon source. For heavy- 
isotope-labeled samples, arginine and lysine were replaced with 13 C 6 -Arg 
and 2 H 4 -Lys isotopes at the equivalent molar concentration. WN1269 
(wild type) was grown in heavy arginine and lysine isotopes, and WN1308 
(Ae/p) was grown in light isotopes in this medium for 16 h to ensure 
complete labeling of all proteins. Strains were subsequently subcultured 
1:200 in the same media and grown to an optical density (600 nm) of 0.5 
(mid-log phase), at which point the cells were harvested by centrifugation. 
The pellets were subjected to lysis by heating to 99°C for 5 min in fresh 
lysis buffer (1% deoxycholate [DOC] in 50 mM ammonium bicarbonate 
[NH 4 HC0 3 ] at pH 8). Cell debris was removed by centrifugation at 
13,000 X g for 15 min, and the supernatant was frozen at — 80°C until 
used. 

Mass spectrometry and proteomic data analysis. For analysis of the 
isotope-labeled lysates, 30 jug (each) of protein from WN1269 (heavy) 
and WN1308 (light) were combined, fractionated into 12 pieces by gel 
electrophoresis, and in-gel trypsin digested, following a previously out- 
lined procedure (43). The resulting peptides were subjected to liquid 
chromatography coupled to tandem mass spectrometry using an Orbitrap 
XL mass spectrometer (Thermo Scientific) as described previously (44). 
Data analysis was conducted using the MaxQuant software program (45) 
to generate an average normalized heavy/light ratio over three biological 
replicates, and significance B values were calculated using Perseus soft- 
ware (46). To determine significance, we used the cutoff of a significance 
B score of less than 0.01 in at least one trial. This statistic measures signif- 
icance within a single trial even for proteins that were identified in only 
one the three biological replicates. When we assessed significance using 
the average of significance B scores across all three trials (excluding scores 
where the protein of interest was not identified in a given trial), we ob- 
tained similar percentages of significantly misregulated proteins contain- 
ing EF-P target motifs. For example, using an average significance B score 
of less than 0.05, we identified 107 significantly misregulated proteins. 
Sixty-one of these are greater than 2-fold downregulated in the efp mu- 
tant, and 25 of these 61 contain a PPP, PPG, or APP motif. We report all 
expression ratios and significance B scores in Table S2 in the supplemental 
material. 

DAVID analysis. Gene lists were generated based on the SILAC data 
and on the presence of EF-P target motifs in Salmonella proteins, as de- 
scribed in the text and the figure legends. Gene lists were uploaded to the 
Database for Annotation, Visualization and Integrated Discovery (DA- 
VID) online analysis software program, and we employed the functional 
annotation tools to determine overrepresented annotation groupings 
among our input genes (22). DAVID employs a one-tailed Fisher exact 
probability test to calculate P values for individual annotation groups, and 
cluster P values are generated as the geometric means of the P values of all 
constituent groups. Functional annotation clusters with P values less than 
0.05 are shown in figures. A full breakdown of clusters can be found in 
tables in the supplemental material. 

Bioinformatic identification of prominent tripeptide motifs. Out of 
the 1,517 Salmonella proteins identified by SILAC, 1,294 were retrieved 
from the Kyoto Encyclopedia of Genes and Genomes (KEGG) database 
(http://www.genome.jp/kegg/), parsed, and processed for further analysis 
of prominent tripeptide motifs using a customized Perl script. All possible 
combinations of 20 common amino acids in 3-digit length were enumer- 
ated and counted for their respective occurrences in three designated 
groups, namely, high 10%, middle 80%, and low 10%, ranked by the 
gene's WN1269/WN1308 SILAC ratio. 



Expected occurrences in each group were calculated by first determin- 
ing the rate at which each amino acid occurred in the retrieved database 
(e.g., Ala accounts for 10% of all retrieved amino acids) and then calcu- 
lating the accumulated probability of each specific tripeptide motif, as- 
suming all combinations are completely unbiased (e.g., Ala-Ala-Ala is 
expected to occur in [10%] 3 = 0.1% of all tripeptides inspected), finally 
multiplying this probability with the three-digit sliding windows in- 
spected in each group (e.g., if the high-10% group contains 55,650 sliding 
windows of three consecutive amino acids, the expected occurrence of 
Ala- Ala- Ala is then 55,650 X 0.1% = 55.65). 

The enrichment index of each tripeptide in a group was calculated as 
the ratio of actual-over-expected occurrences of that tripeptide in that 
group, normalized by the actual-over-expected occurrence of that tripep- 
tide in total (total, in this case, refers to the 1,294 sequences retrieved from 
the SILAC data): Enrichment group = (occurrence in one group/expecta- 
tion in that group)/(occurrence in all groups/expectation in all groups). 

The P value of each tripeptide was calculated using a chi-square test 
with Yates' correction as follows: 

. v (Occurrence — Expectation! — 0.5) 2 

Chi square = Zj ; 

Expectation 

Top candidate motifs most enriched in proteins downregulated in the 
efp mutant are screened with a P value of <0.0005 and an Enrichment low - 
est io% value of <1 and then ranked by Enrichment^^^, 10% . Similarly, 
top candidate motifs enriched in upregulated genes are screened by a 
P value <0.0005 and Enrichement higllestloo/o value of <1 and then ranked 
byEnrichment lowestl0% . 

Motif verification. Luria-Bertani (LB) broth and base M9 minimal 
salts medium were of standard composition (47). Media were supple- 
mented with 200 /ug/ml ampicillin when required. All cultures were incu- 
bated at 37°C. Overnight cultures of E. colt strains harboring pBAD30 
constructs in LB were diluted to an optical density at 600 nm (OD 600 ) of 
0.05 in M9 medium supplemented with 0.2% glycerol. After 2 h or when 
the OD 600 reached 0.1 to 0.15, the culture was supplemented with 0.2% 
arabinose to induce synthesis of the GFP and mCherry reporter proteins, 
and fluorescence was assessed using a spectrofluorimeter (Horiba) at des- 
ignated time points. Cells were analyzed for GFP using excitation at 
481 nm and emission at 507 nm and for mCherry with excitation at 
587 nm and emission at 610 nm. The background level with blank me- 
dium was subtracted, and the ratio of GFP fluorescence over that of 
mCherry was calculated. Reported values represent averages and standard 
deviations determined from three independent experimental replicates. 

PoxB-sfGFP translational fusion assay. Wild-type (S. Typhimurium 
strain 14028s) and isogenic Ae/p mutant (WN1405) strains carrying plas- 
mids bearing full-length or truncated poxB translational fusions to sfGFP 
were grown in MOPS minimal medium supplemented with 0.2% (wt/vol) 
glucose and 20 /ig/ml chloramphenicol. Growth was conducted for 16 h at 
37°C in a Tecan Infinite M200 microplate reader with constant aeration. 
OD 600 and GFP fluorescence readings (excitation and emission wave- 
lengths of 475 nm and 511 nm, respectively) were taken every 15 min. For 
both OD 600 and GFP, background values taken from no-cell controls were 
subtracted from all readings. For clarity, the wild-type/Ae/p fluorescence 
ratios are displayed as GFP fluorescence per OD 600 unit at 10 h of growth. 
Results obtained when cultures were measured at 6, 8, 12, and 16 h of 
growth were similar to those at 10 h. 

SUPPLEMENTAL MATERIAL 

Supplemental material for this article may be found at http://mbio.asm.org 
/lookup/suppl/doi: 10.11 28/mBio.OO 1 80- 1 3/-/DCSupplemental. 

Figure SI, PDF file, 0.1 MB. 

Figure S2, PDF file, 0.2 MB. 

Figure S3, PDF file, 0.1 MB. 

Figure S4, PDF file, 0.1 MB. 

Table SI, PDF file, 0.5 MB. 

Table S2, XLSX file, 0.2 MB. 
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